Generating Lexicalization Patterns for Linked Open Data

نویسنده

  • Rivindu Perera
چکیده

The concept of Linked Data has attracted increased interest in recent times due to its free and open availability and the sheer of volume. We present a framework to generate patterns which can be used to lexicalize Linked Data. We use DBpedia as the Linked Data resource which is one of the most comprehensive and fastest growing Linked Data resource available for free. The framework incorporates a text preparation module which collects and prepares the text after which Open Information Extraction is employed to extract relations which are then aligned with triples to identify patterns. The framework also uses lexical semantic resources to mine patterns utilizing VerbNet and WordNet. The framework achieved 70.36% accuracy and a Mean reciprocal Rank value of 0.72 for five DBpedia ontology classes generating 101 lexicalizations.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Multi-strategy Approach for Lexicalizing Linked Open Data

This paper aims at exploiting Linked Data for generating natural text, often referred to as lexicalization. We propose a framework that can generate patterns which can be used to lexicalize Linked Data triples. Linked Data is structured knowledge organized in the form of triples consisting of a subject, a predicate and an object. We use DBpedia as the Linked Data source which is not only free b...

متن کامل

RealText-lex: A Lexicalization Framework for Linked Open Data

Linked Open Data (LOD) is growing rapidly as a source of structured knowledge used in a variety of text processing applications. However, the applications using the LOD need to be able to mediate between the front end user interfaces and LOD. This often requires a natural language interpretation of this structured, linked data. We demonstrate a middle-tier framework that can generate patterns w...

متن کامل

Lexicalizing DBpedia with Realization Enabled Ensemble Architecture: RealText-lex2 Approach

DBpedia encodes massive amounts of open domain knowledge and is growing by accumulating more triples at the same rate as Wikipedia. However, the applications often require natural language formulations of these triples to present the information as a natural text. The RealTextlex2 framework offers a scalable platform to transform these triples to natural language sentences using lexicalization ...

متن کامل

Multilingual Question Answering over Linked Data (QALD-3): Lab Overview

The third instalment of the open challenge on Question Answering over Linked Data (QALD-3) has been conducted as a half-day lab at CLEF2013. Di↵erently from previous editions of the challenge, QALD-3 put a strong emphasis on multilinguality, o↵ering two tasks: one on multilingual question answering and one on ontology lexicalization. While no submissions were received for the latter, the former...

متن کامل

Talmy’s Dichotomous Typology and Japanese Lexicalization Patterns of Motion Events

Talmy‘s (1985) crosslinguistic typology of lexicalization patterns of motion events have been extensively used in second language acquisition (SLA) research as a means to examine how second language (L2) learners map form, meaning, and function. These studies have yielded some conflicting results regarding the learnability of L2 lexicalization patterns  arguably the oversimplification over and...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015